List of Flash News about Humanity’s Last Exam
Time | Details |
---|---|
2025-03-25 17:06 |
Gemini 2.5 Pro Experimental Achieves Leading Scores in Math and Science Benchmarks
According to Google DeepMind, Gemini 2.5 Pro Experimental has achieved leading scores in math and science benchmarks, specifically GPQA and AIME 2025, without test-time optimizations. This indicates its robust performance capabilities. Additionally, it scored 18.8% on Humanity’s Last Exam, showcasing its advanced reasoning and knowledge capabilities. |